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Abstract 

In this paper, we use the Markov property introduced in Balan and 
IvanofT (2002) for set-indexed processes and we prove that a Markov prior 
distribution leads to a Markov posterior distribution. In particular, by 
proving that a neutral to the right prior distribution leads to a neutral to 
the right posterior distribution, we extend a fundamental result of Doksum 
(1974) to arbitrary sample spaces. 
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1 Introduction 

Bayesian non-parametric statistics is a field that has been introduced by Fergu- 
son in 1973 and has become increasingly popular among the theoretical statis- 
ticians in the past few decades. The philosophy behind this field is to assume 
that the common (unknown) distribution P of a given sample X = {Xi , . . . , Xn) 
is also governed by randomness, and therefore can be regarded as a stochastic 
process (indexed by sets) . The best way for a Bayesian statistician to guess the 
"shape" of the prior distribution P is to identify the posterior distribution of P 
given X and to prove that it satisfies the same properties as the prior. 

Formalizing these ideas, we can say that a typical problem in Bayesian non- 
parametric statistics is to identify a class S of "random distributions" P such 
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that if X is a sample of n observations drawn according to P, then the posterior 
distribution of P given X_ remains in the class S. The purpose of this paper 
is to introduce a new class E for which this property is preserved. This is the 
class of Q-Markov processes (or distributions), which contains the extensively 
studied class of neutral to the right processes. 

There are two major contributions in the literature in this field. The first 
one is Ferguson's (1973) fundamental paper where it is shown that the pos- 
terior distribution of a Dirichlet process is also Dirichlet. (By definition, a 
Dirichlet process with parameter measure a has a Dirichlet finite dimensional 
distribution with parameters a{Ai), . . . ,a{Ak),a{{U^^iAiy) over any disjoint 
sets Ai, . . . ,Ak S B.) The second one is Doksum's (1974) fundamental paper 
where it is proved that if A' = R, then the posterior distribution of a neutral 
to the right process is also neutral to the right. (A random probability dis- 
tribution function F := (Ft)tgR is neutral to the right if Ft^, {Ff^ — Fti)/{1 — 
FtJ, (Ft^ - Ft^_j/{1 - Ft^_-^) are independent V^i < . . . < tk, or equiva- 
lently, Yt := — ln(l — Ft),t S R is a process with independent increments.) A 
quick review of the literature to date (Ferguson, 1974; Ferguson and Phadia, 
1979; Dykstra and Laud, 1981; Hjort, 1990; Walker and Muliere, 1997; Walker 
and Muliere, 1999) reveals that neutral to the right processes have received 
considerably attention in the past three decades, especially because of their ap- 
pealing representation using Levy processes and because of their applications in 
survival analysis, reliability theory, life history data. 

In the present paper we extend Doksum's result to the class of Q-Markov 
processes introduced in Balan and Ivanoff (2002), which are characterized by 
Markov- type finite dimensional distributions. Unlike Doksum's paper (and un- 
like most of the statistical papers generated by it) our results are valid for 
arbitrary sample spaces A", which can be endowed with a certain topological 
structure (in particular for X = R'^). Our main result (Theorem 3.4) proves that 
if P := {PA)AeB is a set-Markov random probability measure and Xi, . . . , Xn is 
a sample from P, then the conditional distribution of P given Xi, . . . , X,,, is also 
set-Markov. This result is new even in the case A" = R, when the set-Markov 
property coincides with the classical Markov property. 

The paper is organized as follows: 

In Section 2 we describe the structure that has to be imposed on the sample 
space X (which will be assumed for the entire paper); under this structure we 
identify the necessary ingredients for the construction of set-Markov (respec- 
tively Q-Markov) random probability measure. 

In Section 3 we introduce the Bayesian nonparametric framework and we 
prove that a set-Markov prior distribution leads to a set-Markov posterior dis- 
tribution. The essence of all calculations is an integral form of Bayes' formula. 

In Section 4 we define neutral to the right processes and using their Q- 
Markov property we prove that a neutral to the right prior distribution leads to 
a neutral to the right posterior distribution. 

The paper also includes two appendices: Appendix A contains two elemen- 
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tary results which are used for the proof of Theorem 3.4; Appendix B contains 
a Bayes property of a classical Markov chain, which is interesting by itself and 
which has motivated this paper. 

2 Q-Markov random probability measures 

Let {X,B) be an arbitrary measurable space (the sample space). 

Definition 2.1 A collection P := {PA)AeB of [0, l]-valued random variables is 

called a random probability measure if 

(i) it is finitely additive in distribution, i.e., for every disjoint sets (^j)j=i,...,fe 
and for every 1 < ii < . . . < < ^) the distribution of (P ii P^k ^ ) 

coincides with the distribution of {J2l^=i ,■■■■> Yl'j=i„ ^Aj)', 

(ii) Px = 1 a.s.; and 

(ill) it is countably additive in distribution, i.e., for every decreasing sequence 
(^n)n C B with n„A„ = we have lim„ Pa„ = a.s. 

Note that the almost sure convergence of (iii) (in the above definition) is 
equivalent to the convergence in distribution and the convergence in moan. 

In order to construct a random probability measure P on S it is enough to 
specify its finite dimensional distributions fiAi...Ak over all un-ordered collections 
{^1, . . . ,Ak} of disjoint sets in B. Some conditions need to be imposed. 

Condition CI. If {Ai, . . . ,Ak} is an un-ordcrcd collection of disjoint sets 
and we let A\ :— U*Lj^_^_^]^Aj; I — 1, . . . , m for 1 < «i < . . . < i„i < k, then 

I^A[...A'^ = t^Ai...Ak°a~^, where a{xi,...,Xk) = (Ej=i ^^j' • • • , Ej=i„_i+i ^^j)- 

Condition C2. For every {An)n ^ B with A„+i C A„,Vn and n„A„ = 0, we 
have lim„ /iA„ = ^o- 

In this paper we will assume that the sample space X has an additional 
underlying structure which we begin now to explain. 

Let X he a (Hausdorff) topological space and B its Borel a- field. We will 
assume that there exists a collection A of closed subsets of X which generates 
B (i.e. B = (j{A)) and which has the following properties: 

1. %,XeA; 

2. ^ is a semilattice i.e., A is closed under arbitrary intersections; 

3. Vyl, B e >l; A, B ^ ^ A n S 7^ 0; 

4. There exists a sequence (-4„)„ of finite sub-semilattices of A such that 

VA e A, there exist A„ G ^„(M),Vn with A = n„v4„ and A C A°,Vn. 
(Here An{u) denotes the class of all finite unions of sets in An-) 



3 



More details about this type of structure can be found in Ivanoff and Merzbach 
(2000), where A is called an indexing collection. By properties 2 and 3, the 
collection A has the finite intersection property, and hence its minimal set 
^' •= ^AeA\{$}^ non-empty. 

The typical example of a sample space X which can be endowed with an 
indexing collection is R''; in this case A = {[0, G R"*} U {0,R''} and the 
approximation sets A„ have vertices with dyadic coordinates. 

We denote with A{u) the class of all finite unions of sets in A, with C the 
semialgebra of the sets C = A\B with A € A, B G A{u) and with C{u) the 
algebra of sets generated by C. Note that B = a{C{u)). 

We introduce now the definition of the Q-Markov property. This definition 
has been originally considered in Balan and Ivanoff (2002) for finitely additive 
real- valued processes indexed by the algebra C{u). In this paper, we will restrict 
our attention to random probability measures. 

Definition 2.2 (a) For each Bi,B2 S A{u) with Bi C B2, let QbiB2 be a 
transition probability on [0,1]. The family Q := {QbiB2)bi<zb2 called a trein- 
sition system ifMB^ C B2 Q B3 in ^(u),V^i € [0, 1], VFa'e B{[0, 1]) 

QBiB3izi;T3) = / QB2B3{Z2;T3)QBiB2{zi;dZ2) 

J[0,1] 

(b) Given a transition system Q := {QbiB2)bi<zb2^ o, random, prohaMlity mea- 
sure P :~ {Pa)a^B! defined on a probability space (0,JF, "P), is called Q- 
Markov i/VBi C B2 in A{u), VTa e B{[0, 1]) 

V[Pb2 e TsIJ^bJ = Qb,B2{Pb,;T2) a.s. 

where Tb, ■= (t{{Pa-,A<E A.,A(Z Bi}). 

A Q-Markov random probability measure can be constructed using the fol- 
lowing additional consistency condition. 

Condition C3. If {Yi, . . . ,Yk) is a vector with distribution iJ,Ci...Ck where 
Ci = Bi;Ci = Bi\Bi^i-,i = 2, . . . ,k and Bi (- . . . C Bk are sets in A{u), then 
for every i = 2, . . . , /s, the distribution of Yi given Yi = yi, . . . , Yi-i = yi-i 
depends only on y := X]j=i Vj ^ii^ i'' equal to QBi^iBiiy, y + •)• 

The next result follows immediately by Kolmogorov's extension theorem. 

Tiieorem 2.3 Let Q := {QbiB2)bi<zb2 &e « transition system. For each un- 
ordered collection {Ai, . . . , Ak\ of disjoint sets in B let HAi...Ak be a probability 
measure on ([0, l]'^, £?([0, 1])*^) such that C1-C3 hold; let ^0 = 5q,ixx = <5i. 
Then there exists a probability measure on ([0, l]'^, S([0, 1])'') under which 
the coordinate-variable process P := (P^)^gg is a Q-Markov random probability 
measure whose finite dimensional distributions are the measures ij,Ai...a^- 
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Examples: 



1. Let P be the Dirichlet process with parameter measure a. For any 
disjoint sets Ai, . . . ,Ai. in B, {Pai , ■ ■ ■ , Pa^ ) has a Dirichlet distribu- 
tion with parameters a{Ai), . . . , a{Ak), a{{U^^j^Aiy). The ratio PaJ{1 — 
'^'j^i PAj) is independent of Pai, ■ ■ ■ , PAi^i and has a Beta distribution 
with parameters a{Ai), a{{Uj^iAjY); hence the distribution of Pa^ given 
Pai, • • • , PAi-i depends only on ■ The process P is Q-Markov 
with QbiB2{z1'^2) equal to the value at — Zi)/{1 — Zi) of the Beta 
distribution with parameters a{B2\Bi), a{B2). 

2. Let P := (l/N) J2j'=i be the empirical measure of a sample Zi,. . . , 
from a non-random distribution Pq on X. For any disjoint sets Ai, . . . ,Ak 
in B, {NPai, ■ ■ ■ , NPa^) has a multinomial distribution with A'' trials 
and Po(Ai), . . . , Po{Ak) probabilities of success; hence the distribution 
of NPAi given NPa^, ■ • ■ , NPAi_i depends only on (it is a bino- 
mial distribution with N{l~jyp^ Pa^ ) trials and -Po(^i)/(l-Ej=l M^j)) 
probability of success). The process P is Q-Markov with 

/mi (m2\\_f N-mi \ PoiC)'"--"'^ Po{B:,f-"'^ 
^B^B^ \^^\^ ]) yrn2-mi J Po(Bf)^-'»i 

where ^ ^ ^ = a\ / b\{a — b)\\s the binomial coefficient and C = B2\Bi. 

3. Let P := (l/N) ^jLi Sw, be the empirical measure of a sample Wi , . . . , Wn 
from a Dirichlet process with parameter measure a. For any disjoint 
sets Ai,...,Ak in B, {NPa^, ■ ■ ■ , NPa^) has a Pdlya distribution with 
N trials and parameters a{Ai), . . . ,a{Ak),a{{U^^iAiy); hence the dis- 
tribution of NPAi given NPai, ■ ■ ■ ,NPAi_^ depends only on 

(it is a Polya distribution with A'^(l — X^^CiPAj) trials and parameters 
a{Ai),a{{{Ji^-i^AjY)). The process P is Q-Markov with 

/mi rm2)_\ _ / N -mi \ a(C)["2-miiQ,(_gc)[jv-m2] 
Qb,b, ^ ; \ ^ /J - ^2 - mi ) a(Bf)[^-™il 

where = a{a + 1) . . .{a-\- x — 1) and C = B2\B\. 

3 The posterior distribution of a Q-Markov ran- 
dom probability measure 

We begin to introduce the Bayesian nonparametric framework. 
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Let P := {PA)AeiS be a Q-Markov random probability measure defined on a 
probability space (O, J^, V) and Xi : Q. ^ X ,i = 1, . . . ,n some ^/B-measurable 
functions such that V^i ,An G B 

n 

P[Xi€Ai,...,XneAn\P]=Y[PA, a.s. 

i=l 

We say that X := {Xi, . . . ,X„) is a sample from P. The distribution of 

P is called prior, while the distribution of P given X is called posterior. Note 
that {PA)AeB and Xi, . . . , X„ can be constructed as coordinate- variables on the 
space ([0, 1]^ X Af", B([0, 1])^ x S") under the probability measure V defined by 

j=i -'^ i=i 

where P^ is the probability measure given by Theorem 2.3. 

The goal of this section is to prove that the posterior distribution of P given 
X_ = xis Q^-' -Markov (for some "posterior" transition system Q^-^). 

Let a„ be the law of X under P and fJ,Ai,...,Ak be the law of {Pa^ , • • • , PAk) 
under P, for every Ai,...,Ak G B. Note that aniUi=iA) = S[U.i=iPA,h 
where £ denotes the expectation with respect to P. 

For each set Bi G A{u), let be the law of (Xi, . . . ,X„,Pbi) under V. 
Note that !^Bi(nr=i X ^i) = .?[nr=i Pa, ■ I^dPB^)] and 

!/Si(ixri)= / ^i^£{Ti)an{dx) = [ QbM-,A)hbMzi) (1) 

J A JTi 

where ^^.'(ri) := V[Pb, € TijX = ^ and A) := V[X G i|PB, = ^i]. 

For each sets -61,-82 G A{u); Bi C B2, let VB1B2 be the law of (Xi, .... X„, 
Pb, , PsJ under P. Note that z^BiB. (nr=i x Fi x F2) = £:[nr=i Pa, {Pb,) 
h^iPB^)] and 

i/sis.(ixri xr2) = / / Qf^BM;T2)ii%{dz^)an{dx) (2) 
= / QBiB2{zi,z2;A)iJBiB2{dzixdz2) (3) 

where 

r2) := G F2IX = X, Pb, = zi] (4) 

and Qb-^B2{zi,Z2]A) := P[X G A|Pbi = Zi,Pb2 = 22]- (For the first equality 
we used the first integral in the decomposition (1) of z^bJ- 
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Using the second integral in the decomposition (1) of vbi and the Q-Markov 
property for representing UbiB^ we get: (for /is^-almost all z{) 

I QBiB2(^i;r2)QBi(^i;rf^) = / QbibAzi^Z2;A)Qb^b.2{zi;<1z2)- (5) 

J A JT2 

This very important equation is the key for determining the posterior transi- 
tion probabilities Q § b fro™ the prior transition probabilities QbiB2i providing 

thatQBAzi;U.ti^y= ^IIVUPaAPb, = zi] and QbiB,(zi, Z2; nr=i ^0 = 
^[nr=i l-^Si = zi, Pb2 = Z2] are easily computable. 

We note that each Q^b^'b- (-^i; ') well-defined only for i^^^-almost all {x, zi). 
Moreover, as we will see in the proof of Theorem 3.4 and it was correctly pointed 
out by an anonymous referee, Q^-^ may not be a genuine transition system 
as introduced by Definition 2.2. (a). To avoid any confusion we introduce the 
following terminology. 

Definition 3.1 The family Q^^^ := {QbIb2)biCB2 defined by (4) is called a 
posterior transition system ( corresponding to P and X_) if \/Bi C B2 C B3 
in A{u), VTa e S([0, 1]) and for vb^ -almost all {x, zi) 

Q|B3(^i;r3)- / 0gB3(^2;r3)Qgs^(zi;dz2) 

In this case, we will say that the conditional distribution of P given X = x is 
Q(£) -Markov i/VSi C B2 in A(u), VFa S S([0, 1] 

V{Pb2 e r2 \Tb, , x\ = g^fk (-Pbi ; r2) a.s. 

We proceed now to the proof of the main theorem. Two preliminary lemmas 
are needed. 

Let B\ C B2 be some arbitrary sets in A{u), C := B2\Bi and < I < r < n. 
The next lemma shows us what happens intuitively with the probability that 
the first / observations fall in Bi , the next r — I observations fall in C and the 
remaining n — r observations fall in S^, given Pb^ and P^^. 

Lemma 3.2 For each B\ C B2 in A{u) and Ai, . . . ,An G B, let 

I r n 

A := l[{Ai n Bi) X Yl {Ai nC) x H {An B^) (6) 

i—1 i—r-\-l 

where C := B2\Bi and < I < r < n. Let Ai JlLiC^* ^ ^1) x -^""'^ 

(a) For iJ.Bi-almost all z\, Qb^{z\\A) = Qb^{z\\A{) ■ Qbi(2:i;^23)- 

(b) For HB-^B2-o^ff^ost all {z\,Z2), 

Qsi S2 (^1 > ^2 ; ^) = Qsi (^1 ; ) ■ S2 (2:1 , 2:2 ; -42 ) ■ Qss (2^2 ; ^3 ) • 
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Proof: We will prove only (b) since part (a) follows by a similar argument. Note 
that the sets A form a 7r-system generating the a-field B" on xC"' x (B^)"""". 

Since (j{A) = B and ^ is a 7r-system, using a Dynkin system argument, it is 
enough to consider the case Ai, . . . ,An € A. Note that 

n n 
f [ n PA^nB^J^B,] = £[ H PA,nB^JPB,]=QBAPB.;A3). 
i=r+l i=r+l 

By double conditionning with respect to J^B2 i we have 

/ r n 

QbiB2{zi,Z2;A) = £[Y[PAinBi Pa.dc -Pa^ nB| | -Pbi = ^i, -Pbj = ^2] = 

i=l i=;+l i=r+l 

I r 

QBAz2;A3)-£[l[PA,nB^ ■ n PA,nc I -Pbi = ^i,Pb. = Z2]. 

i=l i=l+l 

For the second term we have 

; r 

£[iiPA.nB, n Pa 

1 i=i+l 

I r 

£[[lPA,nB^£[ n ^'A.ncK^A inBi)i<l,PBi,PB2] I -Pbi,-Pb2]- 
Since PAiOC = -PsiuCAinSs) - Pbi, using Lemma A.l (Appendix A) 

ir\Bi)i<l, Pbi, PB2] — QbiB2{Pbi, Pbi'i -^2)- 

i=l+l 

(In order to use Lemma A.l, we need C A1+2 C . . . C A^. Note that this is 
not a restriction since if we can consider the minimal semilatticc {A[, . . . , A'^} 
determined by the sets Ai+i, . . . ,Ar, which is ordered such that Aj % U/^^^jAJVj, 
and we let B'^ = y^{^^A!^ and C] = B'j\B'^_^, then each A, = OjeJiC'^ for 
some Ji C {1, . . . ,m}. We have Ai n C = UjgjJ(S^- n C)\{B'j_-^ n C)] and 
nl=;+i PAiDC = h{PB[nc, Pb'^hc) for some function h.) 
Finally, since J^Bi is conditionally independent of given Pb^ and P^inSi . * < 
; arc J'bi -measurable, we have £[ni=i PAiHBi \ Pbi , PB2] = £[Ili=i PAinBi | Pbi] 
= Qbi {Pbi :Ai), which concludes the proof □ 

Note: Let A12 := Ai Cl A2. By a similar argument one can show that 

QbiB2{zi,Z2;Ai2) = Qbi{zi;Ai) ■ QbiB2{zi,z2; M) (7) 
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QbiB2{zi,Z2;A23) = QbiB2(2i,Z2; A2) • Qb2{z2-:A3) 



(8) 



The next lemma tells us that if Bi C B2 are "nicely-shaped" regions and we 
want to predict the value of Pb^ given the value of Pg^ and a sample X from 
P, then wc can forget all about those values Xi which fall inside the region 
Bi . The reason for this phenomenon is the very essence of the Markov property 
given by Definition 2.2. (b), which says that for predicting the value of it 
suffices to know the value of Pb^ , i.e. all the information about the values of P 
inside the region Bi can be discarded. 

Lemma 3.3 For every Bi,B2 G A{u) with Bi Q B2, for every T2 £ ,8([0, 1]) 
and for VBi-almost all (x_,Zi), b {zi]^2) does not depend on those Xi 's that 

(x) 

fall in Bi; in particular, for vbi -almost all (x, zi) in 5" x [0, 1], Qb'iB2 (-^li ■'^2) = 

QbiB2 (^i;r2). 

Proof: Let Ai, . . . ,An G B and A defined by (6). Using (5) and Lemma 3.2,(b) 
combined with (8) we have 



The result follows by Lemma A. 2 (Appendix A) since on the set B[ x x 
Qbi{zi;-) is the product measure between its marginal with respect 
to the first I components restricted to B[ and its marginal with respect to the 
remaining n — I components restricted to x (by Lemma 3. 2, (a)). 

□ 

Here is the main result of the paper. 

Theorem 3.4 If P := {Pa)a<£B is a Q-Markov random probability measure and 
X [Xi, . . . , Xn) is a sample from P, then the family Q^-^ = iQ^§jg^)BiCB2 
defined by (4) is a posterior transition system and the conditional distribution 
of P given X_ = x is Q^-^ -Markov. 



Proof: By Proposition 5 of Balan and Ivanoff (2002), it is enough to show that 
yBi C B2 C ... C Bk in A{u), Vf e B([0, 1])'= and for a„-almost all x 






Q 



,(£) 

B1B2 




niPB 




Q' 



{zk-i;dzk) ...Q 



B1B2 



{zi;dz2)lJ.§-^ {dzi) 
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or equivalently, for every A G B'^ 



A JT 



Note also that (9) will imply that Q^-^ is a posterior transition system. 

For the proof of (9) we will use an induction argument on fc > 2. The 
statement for fc = 2 is exactly (2). Assume that the statement is true for 
k — 1. For each i?i C i?2 C . . . C jSfc in A(u) we let i'Bi...Bk be the law of 
{Xi, . . . , Pbi, . . . , PbJ under V. Note that V^i, . . . , A„ G B, VFi, . . . , Ffe G 
B([0,1]), i^B,...BMtiA, X UU^j) = f[nr=i^A. -n.ti^r.CPs,)]. On the 
other hand, i^Bi...S|^(A x n?=i ^j) also equal to 



QBi...Bfc(-2^i> • • ■ ,Zk-i\Vk)vBi...Bk-i{dxy.dzi X ... X dzk-i) = (10) 



r T-rfc-i„ -Bi...Bk 

/ ^ (3Bi...Bfc(2:i,---,^:fe;^)MBi...Bfc(rf2;i X ... X 

where Q^g^l ^Jzi, . . . , Zk-iiTk) := ^'[f'Bfc G Ffe|X = x,Pbj = Zj,j < k] and 

<9Bi...B;,(2l,...,Zfc;nr=1^0 ^[^1 S G An\PBj = Zj,j < k] = 

m7=iPAAPB,=z„j<k]. 

Using the induction hypothesis, the measure i'Bi...Bk~i disintegrates as 

QB'L2Bk-i dzk-i) . . . (5siS2 (^i; f^^2)MBi {dzi)a„{dx) 

Therefore, it is enough to prove that for every Ffe G ^([0, 1]) and for z>'Bi...Bfe_i- 
almost all {x,zi,..., Zk-i) 

Q^il..Bk{zi'---'Zk-i;Tk) = Q^il_^s^{zk-i;Tk) (11) 
On the other hand, the measure ^'Bi...Bfc_i disintegrates also as 

QBi...Bk-i{Zl, ■ ■ ■ Zk-i;dx)flBi...Bk-i{dzi X ... X dZk-l) 

with respect to its marginal fJ.Bi...Bk-i with respect to the last fc— 1 components. 
By the Q-Markov property, the measure /iBi...Bfc disintegrates as 

QBk-iBk{Zk-i;dZk)lJ,Bi...Bk-i{dZl X ... X dZk-l). 

Using (10) we can conclude that for -almost all (2:1, . . . , 2;fe_i) 

/. '3Bi...Bfc (^1. ■ • • . Zk-i;'i^k)QBi...Bk-i {zi, Zk-i;dx) = (12) 
J A 
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/ QBi...Bk{zi, ■ ■■ , Zk; A)Q Bk-iBk{zk-i] dzk). 

JTk 

Let Ci Bi;Cj = B^W^^iJ = 2, . . . ,k;Ck+i = B^. Note that each 
Cj G C(u) and (Ci, . . . , C^+i) is a partition of A'; hence each point x, falls into 
exactly one set of this partition. 

We proceed to the proof of (11) and we will suppose that for some < ^ < 
r < n, the points xi,...,xi fall into Bk-i (more precisely, each Xi falls into 
some Cj. with 1 < ji < ■ . ■ < ji < k — 1) , the points . . . ,Xr fall into Cfe 
and the points Xf+i. . . . , a;„ fall into Ck+i- 

The main tool will be (12) where we will consider a set A of the form 

I r n 

A := 1[{A, n C,,) X UiAiD Ck) x [] (A^ n Ck+i), A, e 

Let is := Y[\=i+Mr n Cfe) x X^-^+\A^ := n,=.+i(^* n Cfc+i) x and 
:= ^2 n ^3. We will prove that 

QBi...Bk{zi, ■ . ■ ,Zk;A) = M ■ QBk-iBk{zk-UZk-,A23) (13) 

QBi...Bk_iizi,...,zk-i;A) = M ■QB^_^{zk-i;A23) (14) 
where M := n-=i Qb^.^^b^^ (-^j^-i, (^i n Cj,) x A"""!). Then we will have 

QB^.-Bk (^1. ■ • ■ . Zk-i;Tk)QBi...Bk-i {zi, • • • , Zk-i;dx) = 
M- / QBfc_iBfc(2;fe-i,^;fe;^23)<3Bfe_iSfc(^fe-i;rf2;fe) = 

^- . QBl_^B^izk-i;Tk)QBk-i{zk-i;dx) = 
J A23 

QBk-iBki^k-i;^k)QBi...Bk-i{Zl, • • • , Zk-i;dx) 

where we used (12) and (13) for the first equality, (5) for the second equality 
and (14) for the third equality (taking in account that Qb^ ^Bj. (-Zfe-i; Tfc) does 
not depend on xi, . . . ,xi). Relation (11) will follow immediately. 

It remains to prove (13) and (14). Using Lemma 3 of Balan and Ivanoff 
(2002) we have (for Ai e A): 

n n 

^[ n ^-4,nc.+J^B.] = f [ n PA.nCu+.\PB,\=QB,{PB,;A3) 

i=r+l i=r+l 
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and therefore, by double conditioning with respect to J^Bk 

I r n 

QB,...B,{{^j)i<k;A)=£[\[PA^nc,^ n ^-^.nc, [J PA,nC,+,\PB, ^ z,,j <k] 

i=\ i=l-\-\ i=r+l 

/ r 

j=l i=/+l 

For the second term we have 

I r 

^{[[PA.nc,^ n ^-^.nc. I <k] = 

i=l i=;+l 

i<l;PB,,j <k] I PB,,i<fc]. 

i=l i=i+l 

Since PAinCk = PBk-iU{AinBk) - Pb^-i, using Lemma A.l (Appendix A) 

r 

£[]]_ PAincjPB,,_,u(A.nB,,),i<l;PB,,j <k]=QB,_,B,{PB^_,, Pb, ; ^2 ) . 

i=l+l 

(In order to use Lemma A.l we need Ai+i C A1+2 Q ■ ■ ■ Q Ar, but this is not a 

restriction as wc have seen in the proof of Lemma 3.2.) 

Note that by (8), QB^.^B^^k-i, Zk; A2)-QBk{zk; A3) = QBk-iBk{zk-i,Zk\A23). 
Hence the proof of (13) will be complete once we show that 

I I 
^[11 ^-^.nc,, I Pb„3 < k] = nQB,,_.B,,(^i.-i,^,-.; (Ai nC,-J X X^-^). (15) 

i=l i=l 

But this follows by induction on using Lemma A.l (Appendix A). 

We turn now to the proof of (14). Using Lemma 3 of Balan and Ivanoff 
(2002) we have: 

r n 

f [ n nc. n Pa 

ifiCfc+i l-^Bfc-i] — Osfe-i {PBk-1 ; ^23) 

i=l + l i=r+l 

and thereferore, by double conditioning with respect to J^Bk-i "we obtain the 
following expression for QBi...Bk-i (^1, • • • , -Zfe-i; ^23): 

I r n 

^[11 ^w-. n p^^^C" n PA.nc,+APB,=zj,j<k-i]= 

1=1 i=r+l 
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Qb,_M-1;^23) ■ SillPA.nC,, I Pb, = Zj,j <k-l] 

1=1 

and (14) follows, using (15). The proof of the theorem is complete. □ 

The posterior distribution of a Dirichlet process is also Dirichlet. In the case 
of an empirical measure which corresponds to a sample either from a non-random 
distribution or from a Dirichlet process, the calculations for the posterior tran- 
sition probabilities Q^BiB2 straightforward for samples of size greater 
than 1; however, in the case of a sample of size 1 we have the following result. 

Proposition 3.5 If P := {PA)AeB is the empirical measure of a sample of size 
N from a non-random distribution Pq (respectively from a Dirichlet process with 
parameter measure a ) and X is a sample of size 1 from P, then the conditional 
distribution of P given X = x is Q^^^ -Markov with 



(^) /TOi.rm2\\_ (1) ( mi -4(Bi) _ / 



TO2 - Sx{B2) 

N-1 



where Q^^^ is the transition system of the empirical measure of a sample of size 
N — 1 from Pq (respectively from a Dirichlet process with parameter measure a ). 

Proof: Let P be the empirical measure of a sample from a non-random distri- 
bution Pq. Note that ai(A) = £[Pa] = Po{A),\/A e B. We have 

for ai-almost all x & Bi. The fact that 

^(x) can. (!!^\\ _ n'-i) ( mi f ma - 1 \ 

^B,B2 \n-i'\n-i] 

for ai-almost all a; G C follows from (5), since for every A G B 

n f^^. Anr^ -f\P IP PpjAnC) 
Qb. ( — , An Cj - £[PAnc\PB. - ^] - 

- /mi m2 \ _^ip _ ma, _ ma -mi PpjAnC) 

Similarly one can show that 



N Po(Bf) 
7V-m2 Po{AnB^) 
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and hence 



B1B2 \N -V\n -1 



(1) f mi _ ^ 1712 



] 



) 



for ai-almost all x in 

If P is the empirical measure of a sample from a Dirichlet process with 
parameter measure a, then ai{A) = a{A)/a{X) and a similar argument can be 
used. □ 

4 Neutral to the right random probabihty mea- 
sures 

Let P := (PA)yieS be a random probability measure on X. For every sets 
Bi,B2 G A{u) with Bi C B2, we define VsiSa to be equal to (Pbj — Pbi)/(1 — 
Pbi) on the set {Pbi < 1} and 1 elsewhere; let Fb^b-j be the distribution of 
^BiS2- The next definition generalizes the definition of Doksum (1974). 

Definition 4.1 A random probability measure P := {PA)Aei3 is called neutral 
to the right if for every sets Bi C . . . C B^ in A{u), Pbi , Vb^b-^^ ■ • ■ , ^Bk-iBk 
are independent. 

Comments: 1. A random probability measure P := {Pa)agb is neutral to 

the right if and only if \fBi , B2 G A{u) , Bi C i?2 , Vb^ B2 is independent of Tbi ■ 

2. The Dirichlet process with parameter measure a is neutral to the right 
with FbiB2 equal to the Beta distribution with parameters a{B2\Bi),a{B2). 

3. If we denote Ci = Bi; C, = B,\B,^i:i = 2,...,k, then {Pc, , . . . , Pc J 
has a 'completely neutral' distribution (see Definition B.2); this distribution 
was formally introduced by Connor and Mosimann (1969), although the concept 
itself goes back to Halmos (1944). Note that the Dirichlet process is the only 
non-trivial process which has completely neutral distributions over any disjoint 
sets {^1, . . . , Ak} in B (according to Ferguson 1974, p. 622). 

4. In general, the process Ya := — ln(l — Pa), ^ G S is not additive and hence 
it does not have independent increments, even if Yb^ , Yb^ — Yb^ , • • • , Yb^. — Yb^_^ 
are independent for any sets C B2 C . . . C Bj. in A{u) (the increment ^^g^Xsi 
is not equal to Yb^ — YbJ; therefore, the theory of processes with independent 
increments cannot be used in higher dimensions. 

Proposition 4.2 A neutral to the right random probability measure is Q-Markov 
with 




(16) 
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Proof: For any sets i?i C . . . C B^. in A{u), Pbi , • ■ • , -Pb^ is a Markov chain: 
V[Pb, G T^IPb, = 2:1, ... , Pb,_i = zj_i] = 

nVB,.,B, e ^(^^\Pb, = Zu Vb,B, =V2,..., Vb,_,S,_i = Vj-i] = 
1 Zj-l 

nVB^^.B, e ^f^^] = V[Vb,_.b, e ^^^^|Pb,_. = 2,-1] = 

where := [zi — Zi_i)/(1 — Zi^i), i = 2, ... , 7 — 1 and assuming < 1, Vi. □ 

For any sets Bi C B2 ^ B3 in .4,(u), VB1B3 = VB1B2 +VB2B3 - VB1B2 -^SaSs- 
This leads us to the following definition. 

Definition 4.3 For each Bx.B^ G A{u) with Bi C B2, let FbiB2 be a proba- 
bility measure on [0,1]. The family {Fbj^b2)biOB2 is called a neutred to the 
right system if Vi?i C i?2 C £3 in A{u) 

Fb,b,{'^)= [ Ir{y + z-yz)FB2Bs{dz)FB,B2{dy). 

Com,ments: 1. If we let UbiB2 '■= — lii(l — VB1B2) and GbiB2 be the distri- 
bution of J/b^b^, then for every C C B3 in^(w), UbiBs = UB1B2+UB2B3 
and GbiBs = GbiB2 * Gb2B3- 

2. LetQBiB2(.2i;r2) := i^BiB2((r2-2;i)/(l-2;i)) forzi < 1 andQBiB2(l; •) = 
Si ; then (Fbi B2 ) Si c S2 is a neutral to the right system if and only if (Qbi B2 ) Bi c B2 
is a transition system. 

The following result is the converse of Proposition 4.2. 

Proposition 4.4 // P := (Pa)ag8 *s a Q-Markov random, probability mea- 
sure with a transition system Q given by (16) for a neutral to the right system 
{FbiB2)biCB2, th^n P is neutral to the right. 

Proof: We want to prove that for every Bi,B2 G A{u) with Bi C B2 and for ev- 
ery Ai,...,Ak G A,Ai C B,Ak = Bi, VbiB2 is independent of {Pai, ■ ■ ■ , Pa^)- 
Using the Q-Markov property we have: V[VbiB2 G FjPA. = Zi;i = 1, . . . ,k] = 

V[Pb2 G Zfe + (1 - Zk)T\PB, = Zk] = QB,B2{zk;Zk + (1 - Zk)T) = Fb,B2{^) = 

'P{VbiB2 G r). Since this holds for any Borel set T in [0, 1], the proof is complete. 

□ 

In what follows we will prove that the posterior distribution of a neutral to 
the right random probability measure is also neutral to the right, by showing 
that the posterior transition probabilities Q^bIb2 form (16) for a 

(x) 

"posterior" neutral to the right system {Fg-^Q^)BiCB2- This extends Doksum's 
(1974) result to an arbitrary space X, which can be endowed with an indexing 
collection A. 
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Let P := {PA)AeB be a neutral to the right process and 2L ■= {^i, . . . ,Xn) 

(x) 

a sample from P. In order to define the probability measures FbiB2 '^^^ 
the same Baycsian technique as in Section 3. 

For each sets Bi,B2 G A{u)-,Bi C B2, let (f)BiB2 be the law of Xi, . . . , 
Vb,b2 under V. Note that (j^B^sAULi A^xT)= £[Ui=i Pa^ ■ hiVB.B^)]- On 
the other hand, we have 

4>b,bAA X r) = ^ F'gj,^{T)an{dx) = ^ Tb^b^z] (17) 

where 

4tk (r) € r|x = x] (18) 

and fBiB,(z;i) := V[X e ilFs^s, = z]. 

In the proof of Theorem 4.8 we will see that {FbIb^^Bi'^Bq, may not be a 
genuine neutral to the right system as introduced by Definition 4.3. Therefore 
we need to introduce the following terminology. 

Definition 4.5 The family iF^^g^)BiGB2 defined by (18) is called a posterior 
neutral to the right system ( corresponding to P and X) if VBi Q B2 ^ B3 

in A{u), Vr g 13{[0, 1]) and for an-almost all x 

47k(r)= / Hy + z-yz)Fi%^{dz)Fg^^{dy). 

J[o,i]^ 

The conditional distribution of P given X^ = x_ is called neutral to the right 
ifyBi C B2 inA{u), Vb^b^ is conditionally independent of J^Bi given X_. 

Let C := B2\Bi. For fixed < / < r < n we will consider sets of the form 
:= nl=;+i(^i n C) X nr=r+i(^i n Si), where Ai e B. 

Lemma 4.6 (a) For jiBx-almost all zi, 

(1 - Zl)"~' 

^^^^"^'^^^^ = a„((SO"-' X A") • 
(b) For jjLBiB^-c^^f^ost all (2:1,22), 

Qb,B2{z,,Z2;A2s) = ^^^^^Btr-i xxiy^^^^^ [t^'^'' 
Proof: Without loss of generality we will assume that Ai e A,yi. We have 

n PA^nc- n p..n.,=(i-PBj-- n n a^) 

i=;+l i=r+l i=;+l ^ i=r+l ^ 
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Note that PAinc/(l - -PbJ = ^Bi,(AinB2)uBi) •PAinB|/(l - -PbJ = VBi,AiUB2 - 
VbiB2 and Pb^ is independent of VBi,(^,nB2)uBi, « = ^ + ■ ■ ■ ,r,VB^B2 and 
Vbi,a,uB2,« = r + 1, . . . ,n. 

(a) Take f[ • ], respectively £[ ■ \Pbi = zi] in (19); we get 

(b) Take £{ ■ \VbiB2 = z], respectively £[ ■ IPg^ = -21,^82 = -^2] in (19); we get 

p[ TT ^-^-nc A _^Vt££|,, _ .1 _ ^B,B,(z;i23) 

QB.B2(^l,^2;i23) = (l-.r)"-'£:[n 1^ ft ^^I^B,B2 = ff^] 

7,1-'- -^Bi ■ , 1 ^ J- -^l 

which concludes the proof. □ 

Lemma 4.7 For every -Bi,i?2 G ^(w) with Bi C 1^2, /or every T G ;B([0, 1]) 
and for an-almost allx, F^g^{T) does not depend on those Xi 's that fall in Bi; 
in particular, for an-almost all x in S", -^^^^(r) = FBjS2(r). 

Proof: For arbitrary Ai,. . . ,An G ^ we write 

I r n 

(i-PB.r-'rip..„». n 

1=1 i=l+l ^ i=r+l ^ 

Taking £[ ■],£[■ IVbiSj = ^] and using (20), respectively (21) we get 

«n(nU(-4»nBi)x(B,y-') ~ 

""^^^ = a„((Bf)"-' X A^O ""(^^^^ 

f f,. ,4^_ ^»(^U(-4inBl)x(gf)"-') ~ 

Tb,B2{z,A)- „^((5c)n-; ^ I^B,B2(^,^23). 
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Using (17) we get 

j/'is^i^Widx) = j^fB,B.{z;A)FB,BAdz) = 
^n(nU(-4^ni3i)x(i3^)»-0 f~ - 

a„(nU(-4.ni?i)x(i?y-0 _ /• (^) 

The result follows by Lemma A. 2 (Appendix A). □ 
Here is the main result of this section. 

Theorem 4.8 // P := {PA)AeB is a neutral to the right random probability 
measure and X := {Xi, . . . is a sample from P, then the conditional dis- 
tribution of P given X_ = x is also neutral to the right. 

Proof: Since P is Q-Markov, by Theorem 3.4 the conditional distribution of P 
given X = X is Q^-^-Markov. Using Lemma 4.6, the key equation (5) becomes 

2BiB2(^i;r2)an(c?2) = y" TbiB2 (^Y:^;A23^ QBiB2{zi;dZ2). 

Using Proposition 4.2 and relation (17), the right-hand side becomes (for zi < 1) 

Tb.b^ {z; A23)Fb,b2 (dz) = ^ F^f^^ ( yr^) ""^(dx). 



This proves that Vzi G [0, l),Vr2 € B{[0, 1]) and for a„-almost all x 

2gU(..;r.) = FgU(£j-il). 

Since Q^-^ is a posterior transition system, it follows that {Fjf^Q^)BiCB2 is a 
posterior neutral to the right system. By Proposition 4.4, the distribution of P 
given 2L is neutral to the right. □ 

The next result gives some simple formulas for calculating the posterior 
distribution of Pbi when all the observations fall outside Bi , and the posterior 
distribution of VbiBj when all the observations fall outside B2\Bi. 

Proposition 4.9 (a) For Un-almost all x with Xi e B^ \/i 
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(b) For an-almost all x with Xi G {B2\BiY \/i 
where m denotes the number of Xi 's that fall outside B2 ■ 

Proof: Note that (a) is a particular case of (b) since ^^^^ = F^^^ . We proceed to 

the proof of (b). For fixed < Z < n, let A := nLi(^^ ^iSi) x lllLi+i{^i<^B^), 
where Ai G B. We claim that 

fi,.e.(.;i)Ml-.r-^ °;«gp:^;^;) .„„(i) (22) 
Using (17), it follows that 

and hence for a„-almost all x with Xi e (i32\-Bi)°,Vi 

£:[(1-PbJ"-'] . g[Jr(V-B,5j(l-PBj"-'] 

£[(1-PbJ"] •^[^r(Fs,sJ(l-l/s,Bj ] = ^[(l_p^jn] 

since Pbi is independent of Vs^Sj. 

We turn now to the proof of (22). Without loss of generality we will assume 
that Ai G A,\fi. Let A2 = Yli=i+i{Ai n B^) xXK We have 

In I n r) 

l[PA,nB. n ^A.nB| = (l-PBj"-'n^^*nB.(l-VB,Bj"-' I] TZ^T^- 

Note that (1 - Pbi )""' n!=i Pa ■ nBi is J^B^ -measurable and is independent 
of VbiSs, VB2,AiUB2 , i = Z + 1, . . . , n. By taking £[ ■ \VbiB2 = z] we get 

I n p 

Tb.bAz; A) = {i- zT-' ■ SiU PA.nB, • (1 - PbX-'] ■ £[ n t4^] = 



Finally, by taking expectation in 

l[PA.nB. ■ H Pa.hbi =l[PA.nB, ■ (1 - PbJ-' • JJ 3^ 

i=l i=;+l i=l i=;+l 



(1 - ^)"-' • «.(n(^^ n PO X (i.,T-') • 
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wc get a„(i) = a„(nLi(^» n Bi) x (BJ)"-') • a„(i2)/a„((Bf x A-'). □ 
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A Some elementary results 

Lemma A.l // {Xt)t^ji is a Markov process, then for every Si < . . . < s„ < 
s < ui < ... < Up < t < ti < ... < tm and for every bounded measurable 
function h 

£[h{Xu^, . . . ,Xu^)\Xsi, . . . ,Xs„,Xs,Xt,Xt^, . . . ,Xt^] = . . . ,Xu^)\Xs,Xt\. 

The proof of the previous lemma is elementary and will be omitted. 

Lemma A. 2 Let {X, X, /j.), {Y, y, v) be probability spaces and f : X xY ^ R 
a bounded measurable function. If\/A s X,\/B e y 



fix,y){fi X iy){dx X dy) = fi{A) / fix,y)ifi x iy){dx x dy) 

AxB JxxB 

then f{x,y) does not depend on x, for (/x x i/)-almost all {x,y). 

Proof: Let /s(a;) = f{x,y)u{dy),x G X and Ib = fx fB{x)iJ,{dx). We have 

/ fB{x:)^i{dx) = h{A)Ib = / iB^dx), yAex 

J A J A 

and hence fB{x) = Ib^x G A^q, where A^o is a /i- negligible set. For each x € Nq 

[ f{x,y)u{dy)= f j f{x,y)v{dy)iJi{dx)= f f f{x,y)nidx)u{dy), G y. 
Jb J X J B J B J X 

Hence f{x,y) = J-^ f{x,y)iJ,{dx) := g{y) for all y G N!^, where A^^ is a v- 
negligible set. If we take A'' := {{x,y);x e N§,y e N^Y, then {/j. x i'){N) = 
and f{x,y)=g{y),yix,y) e N'^. □ 



B A Bayes property of a Markov chain 

Lemma B.l Let {Zi,...,Zk) be an increasing Markov chain with values in 
[0, 1], with initial distribution fi and transition probabilities {Qi-i,i)i=2,...,k! let 
Zq := and Z^^i := 1. Let Yj = Zj — Zj_i;j = 1, . . . , A;+ 1 and X be a random 
variable such that 

V[X = . . . , Ffe+i] =Yj Vj = 1, . . . , fc + 1. 
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Then for every j = 1, . . . ,k + 1, the conditional distribution of {Zi, . . . ,Zk) 
given X = j coincides with the distribution of a Markov chain with some initial 
distribution /x'^^ and some transition probabilities (Q^l\ j)i=2,...,fe- 

Proof: Let aj := V{X = j) = £\Yj\. We consider first the case j > 1. For any 
sets Ti, . . . ,rfc e S([0, 1]) we have 

V[Z^&Vi,...,Tk&Vk\X=j] = — [ P[X = j\Z,,...,Zk]dV = 



1 



Hzj){zj - Zj-i)Qj-ij{zj^i;dzj) . . .Qi2{zi]dz2)lJ.{dzi) 



where h{zj) = J^^^^ • ■ ■ /r„ Qk-i,k{zk-udzk) . . . Qj^j+i(zj\dzj+i). We denote 
af{y) := £[Yj\Zj_, = y] and a^\y) := £[a^,{Zi)\Zi_, =y],i< j; we have 

V[Z,er,;i<k\X=j] = ^ [ ai'\z,).—^ f ...o^\z,.^- 

Hzj)izj - Zj-i)Qj-ij{zj-i;dzj) . . .Qi2izi;dz2)nidzi) = 

I ■ I Qi'liki^k-i;dzk)...Q^^{zi;dz2)iJ,^^\dzi) 
where ii^^\T) := {l/aj) j^ai\y)^.{dy) and 

Q^\i{y;T) := Qi-i,i{y;T) ii i > j 

Q}\^(y;r) := j [z - y)Q,.^,,{y;dz) 

ay'iy) Jr 

Q\%{y;r) -.= -J— f a^)^{z)Q,.r,,{y;dz) ii i < j. 
We consider next the case j = 1. For any sets Fi, . . . , e ^([0, 1]) we have 



P[ZieFi,...,FfcGFfc|X = l] = — / r[X = l\Z,,...,Zk]dV. 

/ •■• / Q^klikizk-i;dzk)...Q^i^{zi;dz2)iJ,^^Hdzi) 
Jri Jrk 

where ^(i) (F) = (1/qi) yti{dy) and Qf\i = Oi_i,i, Vi = 2, . . . , fc. □ 

The following definition is taken from Fang, Kotz and Ng (1990), p. 163. 
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Definition B.2 A random vector (Yi, . . . , Yfe) with values in tiie simplex S = 
{{yj)j',yj G [0, 1], X]j=i ^ 1} has a completely newirai distribution if there 
exist some independent random variables Vi, . . . ,Vk such that (li, . . . , Yfe) has 
the same distribution as {Vi,V2{l — Vi), . . . , 14 11^=1 ~ ^O)- 

The following result can be viewed as a complement to Theorem 4 of As- 

gharian and Wolfson (2001). 

Corollary B.3 // (Yi, . . . , Yfe) has a completely neutral distribution, Yfe+i := 
1 — Xl^=i '^"■'^ random, variable such that 

V[X=j\Y,,...,Yk+i]=Yj V j = 1 , . . . , + 1 

then the conditional distribution of {Yi, . . . ,Yk) given X = j is completely neu- 
tral. 

Proof: Let Zi = J]}^, Y, and Vi =Y^,V, := F,/(l - i = 2, . . . , fc. The 

variables Vi, . . . ,Vk are independent and Zi , . . . , is a Markov chain with the 
transition probabilities Qi_i,i(y;r) = i*i((r — y)/{l — y)), where Fj is the dis- 
tribution of Vi. By Lemma B.l, the conditional distribution of (Zi, . . . , Zk) 
given X = j coincide with the distribution of a Markov chain with some 
transition probabilities Q^l.^ j- Direct calculations show that Q\''}n{y;T) = 
fI'\{T - y)/{l - y)) with: F^^'^ = if i > j, 

Fi'\^) = jJ^vF^{dv), fI'\t) = ^ - v)Fi{dv) if i < J, 

where /Jj = f [V^]. The conclusion follows immediately. □ 
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